🤖 Reinforcement Learning - nmarshall · Scour

Catching up with RL: TUD Lecture on RL #1 ⚡Incremental Computation

sebiwette.de·1d

Karpathy with agents 🤖AI agents

breno.bearblog.dev·16h

ALTK‑Evolve: On‑the‑Job Learning for AI Agents 🤖AI agents

huggingface.co·6d·Hacker News, Hacker News

Aethon: A Reference-Based Replication Primitive for Constant-Time Instantiation of Stateful AI Agents ⚙️Concurrency Models

arxiv.org·1h·Hacker News

A problem with perfectly rational agents and decision theory 🧠Memory Models

alexanderpruss.blogspot.com·12h

AI Agents Are Control Systems 🤖AI agents

cloudpresser.com·2d·Hacker News

Use cases for autonomous AI agents 🤖AI agents

news.ycombinator.com·6d·Hacker News

hipvlady/agent-coherence: Arbiter — The Coherence Protocol for AI Agents 🤖AI agents

github.com·10h·Hacker News

Running AI Agents in a Sandbox 🤖AI agents

oligot.be·2d·Hacker News

Claude Code, Codex, and Pi can create their own AI agents now, and that changes everything 🤖AI agents

xda-developers.com·2d

On the usefulness of AI agents 🤖AI agents

erikjohannes.no·5d·Hacker News

Why Multi-Agent Systems Need Memory Engineering 🧠Memory Models

mongodb.com·6d·Hacker News

A composable AI agent framework in TypeScript 🤖AI agents

better-agent.com·6d·Hacker News

How HN: We were wrong about AI capability floors (and why smart triggers matter) 🤖AI Inference

zenodo.org·5d·Hacker News

Rethinking Robotics Reinforcement Learning: A Practical Humanoid Training Workflow 🎮Game Engines

semiengineering.com·5d

The golden rules of agent-first product engineering 🤖AI agents

newsletter.posthog.com·6d·Hacker News

AI Pentesting Agents 2026: 39+ Tools, Architecture Deep Dive & Benchmark Analysis 🤖AI agents

appsecsanta.com·6d·Hacker News

Show HN: HyperFlow – A self-improving agent framework built on LangGraph 🤖AI agents

news.ycombinator.com·4d·Hacker News

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering 🤖AI agents

arxiv.org·5d·Hacker News

Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse 🎯Hindley-Milner

arxiv.org·6d·Hacker News

No more posts from nmarshall's subscribed feeds.

Scour all 23985 feeds Learn more about Feeds